Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CRaC may exit before image dump is completed #46

Closed
wants to merge 6 commits into from

Conversation

wkia
Copy link

@wkia wkia commented Feb 21, 2023

When running CRaC with docker, java may exit before CRIU is finished dumpring because CRIU kills the original java process, and then docker immediately exits.

It could be reproduced with a simple Java test:

public class Test {
    public static void main(String args[]) throws Exception {
        jdk.crac.Core.checkpointRestore();
        System.out.println("finish");
    }
}

and run with docker:
docker <docker_options> $JAVA_HOME/java -XX:CRaCCheckpointTo=./cr Test.java

After the command above finishes, cr/cppath is absent in the case of failure. Or/and it will fail on restore:
docker <docker_options> $JAVA_HOME/java -XX:CRaCRestoreFrom=./cr

This change fixes the issue by forkin'g the main process in case of PID=1 (pid=1 means it was run with docker), and waiting for children processes are finished. This makes us sure that CRIU finalized the dump, if any. At the same time, there is no conflict with PIDs on restore, since the process being restored has PID not equal to 1, if restoring with the command above..


Progress

  • Change must not contain extraneous whitespace

Reviewers

Reviewing

Using git

Checkout this PR locally:
$ git fetch https://git.openjdk.org/crac pull/46/head:pull/46
$ git checkout pull/46

Update a local copy of the PR:
$ git checkout pull/46
$ git pull https://git.openjdk.org/crac pull/46/head

Using Skara CLI tools

Checkout this PR locally:
$ git pr checkout 46

View PR using the GUI difftool:
$ git pr show -t 46

Using diff file

Download this PR as a diff file:
https://git.openjdk.org/crac/pull/46.diff

@bridgekeeper
Copy link

bridgekeeper bot commented Feb 21, 2023

👋 Welcome back rmarchenko! A progress list of the required criteria for merging this PR into crac will be added to the body of your pull request. There are additional pull request commands available for use with this pull request.

@openjdk
Copy link

openjdk bot commented Feb 21, 2023

@wkia This change now passes all automated pre-integration checks.

ℹ️ This project also has non-automated pre-integration requirements. Please see the file CONTRIBUTING.md for details.

After integration, the commit message for the final commit will be:

CRaC may exit before image dump is completed

Reviewed-by: akozlov

You can use pull request commands such as /summary, /contributor and /issue to adjust it as needed.

At the time when this comment was updated there had been 4 new commits pushed to the crac branch:

  • dff32bc: Document CRaCCheckpointTo and CRaCRestoreFrom in java(1) man page
  • 80cab69: Fix failing CRaC tests
  • 0738da8: Close extraneous file descriptors
  • 7dfaf5f: DNS cache maintenance

Please see this link for an up-to-date comparison between the source branch of this pull request and the crac branch.
As there are no conflicts, your changes will automatically be rebased on top of these commits when integrating. If you prefer to avoid this automatic rebasing, please check the documentation for the /integrate command for further details.

As you do not have Committer status in this project an existing Committer must agree to sponsor your change. Possible candidates are the reviewers of this PR (@AntonKozlov) but any other Committer may sponsor as well.

➡️ To flag this PR as ready for integration with the above commit message, type /integrate in a new comment. (Afterwards, your sponsor types /sponsor in a new comment to perform the integration).

@openjdk openjdk bot added ready Pull request is ready to be integrated rfr Pull request is ready for review labels Feb 21, 2023
@mlbridge
Copy link

mlbridge bot commented Feb 21, 2023

Webrevs

@wkia wkia requested a review from AntonKozlov March 1, 2023 16:30
@wkia wkia requested a review from AntonKozlov March 7, 2023 09:37
@rvansa rvansa mentioned this pull request Mar 7, 2023
1 task
Copy link
Member

@AntonKozlov AntonKozlov left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, thanks!

@wkia
Copy link
Author

wkia commented Mar 14, 2023

/integrate

@openjdk openjdk bot added the sponsor Pull request is ready to be sponsored label Mar 14, 2023
@openjdk
Copy link

openjdk bot commented Mar 14, 2023

@wkia
Your change (at version c117d07) is now ready to be sponsored by a Committer.

@AntonKozlov
Copy link
Member

/sponsor

@openjdk
Copy link

openjdk bot commented Mar 15, 2023

Going to push as commit a11b46a.
Since your change was applied there have been 4 commits pushed to the crac branch:

  • dff32bc: Document CRaCCheckpointTo and CRaCRestoreFrom in java(1) man page
  • 80cab69: Fix failing CRaC tests
  • 0738da8: Close extraneous file descriptors
  • 7dfaf5f: DNS cache maintenance

Your commit was automatically rebased without conflicts.

@openjdk openjdk bot added the integrated Pull request has been integrated label Mar 15, 2023
@openjdk openjdk bot closed this Mar 15, 2023
@openjdk openjdk bot removed ready Pull request is ready to be integrated rfr Pull request is ready for review sponsor Pull request is ready to be sponsored labels Mar 15, 2023
@openjdk
Copy link

openjdk bot commented Mar 15, 2023

@AntonKozlov @wkia Pushed as commit a11b46a.

💡 You may see a message that your pull request was closed with unmerged commits. This can be safely ignored.

@wkia wkia deleted the crac_docker_fix branch March 17, 2023 09:24
@wkia wkia mentioned this pull request Mar 20, 2023
1 task
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
integrated Pull request has been integrated
3 participants